Overview
Brought to you by YData
Dataset statistics
| Number of variables | 24 |
|---|---|
| Number of observations | 51290 |
| Missing cells | 41296 |
| Missing cells (%) | 3.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 9.4 MiB |
| Average record size in memory | 192.0 B |
Variable types
| Numeric | 7 |
|---|---|
| Text | 8 |
| DateTime | 2 |
| Categorical | 7 |
Category is highly overall correlated with Sub-Category | High correlation |
Discount is highly overall correlated with Profit | High correlation |
Market is highly overall correlated with Postal Code and 2 other fields | High correlation |
Postal Code is highly overall correlated with Market and 1 other fields | High correlation |
Profit is highly overall correlated with Discount | High correlation |
Region is highly overall correlated with Market and 2 other fields | High correlation |
Row ID is highly overall correlated with Market and 1 other fields | High correlation |
Sales is highly overall correlated with Shipping Cost | High correlation |
Shipping Cost is highly overall correlated with Sales | High correlation |
Sub-Category is highly overall correlated with Category | High correlation |
Postal Code has 41296 (80.5%) missing values | Missing |
Row ID is uniformly distributed | Uniform |
Row ID has unique values | Unique |
Discount has 29009 (56.6%) zeros | Zeros |
Profit has 668 (1.3%) zeros | Zeros |
Reproduction
| Analysis started | 2025-09-25 19:00:10.680235 |
|---|---|
| Analysis finished | 2025-09-25 19:00:18.952603 |
| Duration | 8.27 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
Row ID
Real number (ℝ)
High correlation  Uniform  Unique 
| Distinct | 51290 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25645.5 |
| Minimum | 1 |
|---|---|
| Maximum | 51290 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 400.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2565.45 |
| Q1 | 12823.25 |
| median | 25645.5 |
| Q3 | 38467.75 |
| 95-th percentile | 48725.55 |
| Maximum | 51290 |
| Range | 51289 |
| Interquartile range (IQR) | 25644.5 |
Descriptive statistics
| Standard deviation | 14806.292 |
|---|---|
| Coefficient of variation (CV) | 0.57734464 |
| Kurtosis | -1.2 |
| Mean | 25645.5 |
| Median Absolute Deviation (MAD) | 12822.5 |
| Skewness | -6.0069466 × 10-18 |
| Sum | 1.3153577 × 109 |
| Variance | 2.1922628 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6147 | 1 | < 0.1% |
| 32298 | 1 | < 0.1% |
| 26341 | 1 | < 0.1% |
| 25330 | 1 | < 0.1% |
| 13524 | 1 | < 0.1% |
| 47221 | 1 | < 0.1% |
| 22732 | 1 | < 0.1% |
| 30570 | 1 | < 0.1% |
| 31192 | 1 | < 0.1% |
| 40155 | 1 | < 0.1% |
| Other values (51280) | 51280 |
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 1 | |
| 7 | 1 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 |
| Value | Count | Frequency (%) |
| 51290 | 1 | |
| 51289 | 1 | |
| 51288 | 1 | |
| 51287 | 1 | |
| 51286 | 1 | |
| 51285 | 1 | |
| 51284 | 1 | |
| 51283 | 1 | |
| 51282 | 1 | |
| 51281 | 1 |
Order ID
Text
| Distinct | 25035 |
|---|---|
| Distinct (%) | 48.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 400.8 KiB |
Length
| Max length | 15 |
|---|---|
| Median length | 14 |
| Mean length | 13.569643 |
| Min length | 9 |
Unique
| Unique | 12257 ? |
|---|---|
| Unique (%) | 23.9% |
Sample
| 1st row | CA-2012-124891 |
|---|---|
| 2nd row | IN-2013-77878 |
| 3rd row | IN-2013-71249 |
| 4th row | ES-2013-1579342 |
| 5th row | SG-2013-4320 |
| Value | Count | Frequency (%) |
| ca-2014-100111 | 14 | < 0.1% |
| to-2014-9950 | 13 | < 0.1% |
| ni-2014-8880 | 13 | < 0.1% |
| in-2012-41261 | 13 | < 0.1% |
| in-2013-42311 | 13 | < 0.1% |
| mx-2014-166541 | 13 | < 0.1% |
| in-2011-76625 | 12 | < 0.1% |
| mx-2013-142678 | 12 | < 0.1% |
| in-2014-15263 | 12 | < 0.1% |
| mx-2013-127705 | 12 | < 0.1% |
| Other values (25025) | 51163 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 108762 | |
| - | 102580 | |
| 2 | 90147 | |
| 0 | 85011 | |
| 4 | 45213 | 6.5% |
| 3 | 41592 | 6.0% |
| 5 | 27294 | 3.9% |
| 6 | 25799 | 3.7% |
| 7 | 23086 | 3.3% |
| 8 | 22626 | 3.3% |
| Other values (27) | 123877 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 490827 | |
| Dash Punctuation | 102580 | 14.7% |
| Uppercase Letter | 102580 | 14.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 15332 | |
| S | 13869 | |
| A | 9679 | |
| C | 9196 | |
| N | 8691 | |
| E | 8590 | |
| M | 8499 | |
| X | 7644 | |
| U | 6508 | |
| T | 3748 | 3.7% |
| Other values (16) | 10824 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 108762 | |
| 2 | 90147 | |
| 0 | 85011 | |
| 4 | 45213 | |
| 3 | 41592 | 8.5% |
| 5 | 27294 | 5.6% |
| 6 | 25799 | 5.3% |
| 7 | 23086 | 4.7% |
| 8 | 22626 | 4.6% |
| 9 | 21297 | 4.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 102580 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 593407 | |
| Latin | 102580 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 15332 | |
| S | 13869 | |
| A | 9679 | |
| C | 9196 | |
| N | 8691 | |
| E | 8590 | |
| M | 8499 | |
| X | 7644 | |
| U | 6508 | |
| T | 3748 | 3.7% |
| Other values (16) | 10824 |
Common
| Value | Count | Frequency (%) |
| 1 | 108762 | |
| - | 102580 | |
| 2 | 90147 | |
| 0 | 85011 | |
| 4 | 45213 | |
| 3 | 41592 | 7.0% |
| 5 | 27294 | 4.6% |
| 6 | 25799 | 4.3% |
| 7 | 23086 | 3.9% |
| 8 | 22626 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 695987 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 108762 | |
| - | 102580 | |
| 2 | 90147 | |
| 0 | 85011 | |
| 4 | 45213 | 6.5% |
| 3 | 41592 | 6.0% |
| 5 | 27294 | 3.9% |
| 6 | 25799 | 3.7% |
| 7 | 23086 | 3.3% |
| 8 | 22626 | 3.3% |
| Other values (27) | 123877 |
Order Date
Date
| Distinct | 1430 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 400.8 KiB |
| Minimum | 2011-01-01 00:00:00 |
|---|---|
| Maximum | 2014-12-31 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Ship Date
Date
| Distinct | 1464 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 400.8 KiB |
| Minimum | 2011-01-02 00:00:00 |
|---|---|
| Maximum | 2015-07-01 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Ship Mode
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 400.8 KiB |
| Standard Class | |
|---|---|
| Second Class | |
| First Class | |
| Same Day | 2701 |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 12.843069 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Same Day |
|---|---|
| 2nd row | Second Class |
| 3rd row | First Class |
| 4th row | First Class |
| 5th row | Same Day |
Common Values
| Value | Count | Frequency (%) |
| Standard Class | 30775 | |
| Second Class | 10309 | 20.1% |
| First Class | 7505 | 14.6% |
| Same Day | 2701 | 5.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| class | 48589 | |
| standard | 30775 | |
| second | 10309 | 10.0% |
| first | 7505 | 7.3% |
| same | 2701 | 2.6% |
| day | 2701 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 115541 | |
| s | 104683 | |
| d | 71859 | |
| 51290 | ||
| C | 48589 | |
| l | 48589 | |
| S | 43785 | 6.6% |
| n | 41084 | 6.2% |
| t | 38280 | 5.8% |
| r | 38280 | 5.8% |
| Other values (8) | 56741 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 504851 | |
| Uppercase Letter | 102580 | 15.6% |
| Space Separator | 51290 | 7.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 115541 | |
| s | 104683 | |
| d | 71859 | |
| l | 48589 | |
| n | 41084 | 8.1% |
| t | 38280 | 7.6% |
| r | 38280 | 7.6% |
| e | 13010 | 2.6% |
| c | 10309 | 2.0% |
| o | 10309 | 2.0% |
| Other values (3) | 12907 | 2.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 48589 | |
| S | 43785 | |
| F | 7505 | 7.3% |
| D | 2701 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 51290 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 607431 | |
| Common | 51290 | 7.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 115541 | |
| s | 104683 | |
| d | 71859 | |
| C | 48589 | |
| l | 48589 | |
| S | 43785 | 7.2% |
| n | 41084 | 6.8% |
| t | 38280 | 6.3% |
| r | 38280 | 6.3% |
| e | 13010 | 2.1% |
| Other values (7) | 43731 | 7.2% |
Common
| Value | Count | Frequency (%) |
| 51290 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 658721 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 115541 | |
| s | 104683 | |
| d | 71859 | |
| 51290 | ||
| C | 48589 | |
| l | 48589 | |
| S | 43785 | 6.6% |
| n | 41084 | 6.2% |
| t | 38280 | 5.8% |
| r | 38280 | 5.8% |
| Other values (8) | 56741 |
Customer ID
Text
| Distinct | 1590 |
|---|---|
| Distinct (%) | 3.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 400.8 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.8146227 |
| Min length | 5 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | RH-19495 |
|---|---|
| 2nd row | JR-16210 |
| 3rd row | CR-12730 |
| 4th row | KM-16375 |
| 5th row | RH-9495 |
| Value | Count | Frequency (%) |
| po-18850 | 97 | 0.2% |
| be-11335 | 94 | 0.2% |
| jg-15805 | 90 | 0.2% |
| sw-20755 | 89 | 0.2% |
| em-13960 | 85 | 0.2% |
| my-18295 | 85 | 0.2% |
| mp-17965 | 84 | 0.2% |
| zc-21910 | 84 | 0.2% |
| ck-12205 | 83 | 0.2% |
| af-10870 | 81 | 0.2% |
| Other values (1580) | 50418 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 54738 | |
| - | 51290 | |
| 0 | 43087 | 10.7% |
| 5 | 39862 | 9.9% |
| 2 | 21594 | 5.4% |
| 8 | 14778 | 3.7% |
| 6 | 14703 | 3.7% |
| 7 | 14670 | 3.7% |
| 3 | 14635 | 3.7% |
| 4 | 14487 | 3.6% |
| Other values (30) | 116968 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 246942 | |
| Uppercase Letter | 102373 | |
| Dash Punctuation | 51290 | 12.8% |
| Lowercase Letter | 207 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 9011 | 8.8% |
| C | 8835 | 8.6% |
| S | 8738 | 8.5% |
| B | 8382 | 8.2% |
| D | 6582 | 6.4% |
| J | 6171 | 6.0% |
| A | 5967 | 5.8% |
| H | 5218 | 5.1% |
| P | 5206 | 5.1% |
| R | 4849 | 4.7% |
| Other values (16) | 33414 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 54738 | |
| 0 | 43087 | |
| 5 | 39862 | |
| 2 | 21594 | 8.7% |
| 8 | 14778 | 6.0% |
| 6 | 14703 | 6.0% |
| 7 | 14670 | 5.9% |
| 3 | 14635 | 5.9% |
| 4 | 14487 | 5.9% |
| 9 | 14388 | 5.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| p | 81 | |
| o | 68 | |
| l | 58 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 51290 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 298232 | |
| Latin | 102580 | 25.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 9011 | 8.8% |
| C | 8835 | 8.6% |
| S | 8738 | 8.5% |
| B | 8382 | 8.2% |
| D | 6582 | 6.4% |
| J | 6171 | 6.0% |
| A | 5967 | 5.8% |
| H | 5218 | 5.1% |
| P | 5206 | 5.1% |
| R | 4849 | 4.7% |
| Other values (19) | 33621 |
Common
| Value | Count | Frequency (%) |
| 1 | 54738 | |
| - | 51290 | |
| 0 | 43087 | |
| 5 | 39862 | |
| 2 | 21594 | 7.2% |
| 8 | 14778 | 5.0% |
| 6 | 14703 | 4.9% |
| 7 | 14670 | 4.9% |
| 3 | 14635 | 4.9% |
| 4 | 14487 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 400812 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 54738 | |
| - | 51290 | |
| 0 | 43087 | 10.7% |
| 5 | 39862 | 9.9% |
| 2 | 21594 | 5.4% |
| 8 | 14778 | 3.7% |
| 6 | 14703 | 3.7% |
| 7 | 14670 | 3.7% |
| 3 | 14635 | 3.7% |
| 4 | 14487 | 3.6% |
| Other values (30) | 116968 |
Customer Name
Text
| Distinct | 795 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 400.8 KiB |
Length
| Max length | 22 |
|---|---|
| Median length | 18 |
| Mean length | 12.946227 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Rick Hansen |
|---|---|
| 2nd row | Justin Ritter |
| 3rd row | Craig Reiter |
| 4th row | Katherine Murray |
| 5th row | Rick Hansen |
| Value | Count | Frequency (%) |
| michael | 655 | 0.6% |
| john | 522 | 0.5% |
| paul | 438 | 0.4% |
| patrick | 437 | 0.4% |
| tom | 430 | 0.4% |
| stewart | 426 | 0.4% |
| anthony | 424 | 0.4% |
| frank | 422 | 0.4% |
| alan | 402 | 0.4% |
| bill | 402 | 0.4% |
| Other values (901) | 98324 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 61455 | 9.3% |
| e | 60921 | 9.2% |
| n | 51801 | 7.8% |
| 51592 | 7.8% | |
| r | 48359 | 7.3% |
| i | 40342 | 6.1% |
| l | 34229 | 5.2% |
| o | 30793 | 4.6% |
| t | 27197 | 4.1% |
| s | 23187 | 3.5% |
| Other values (47) | 234136 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 506250 | |
| Uppercase Letter | 105217 | 15.8% |
| Space Separator | 51592 | 7.8% |
| Other Punctuation | 728 | 0.1% |
| Dash Punctuation | 225 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 61455 | |
| e | 60921 | |
| n | 51801 | |
| r | 48359 | |
| i | 40342 | 8.0% |
| l | 34229 | 6.8% |
| o | 30793 | 6.1% |
| t | 27197 | 5.4% |
| s | 23187 | 4.6% |
| h | 19661 | 3.9% |
| Other values (18) | 108305 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 9426 | 9.0% |
| M | 9185 | 8.7% |
| S | 8738 | 8.3% |
| B | 8677 | 8.2% |
| D | 6780 | 6.4% |
| A | 6305 | 6.0% |
| J | 6171 | 5.9% |
| H | 5434 | 5.2% |
| P | 5206 | 4.9% |
| R | 4970 | 4.7% |
| Other values (16) | 34325 |
Space Separator
| Value | Count | Frequency (%) |
| 51592 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 728 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 225 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 611467 | |
| Common | 52545 | 7.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 61455 | 10.1% |
| e | 60921 | 10.0% |
| n | 51801 | 8.5% |
| r | 48359 | 7.9% |
| i | 40342 | 6.6% |
| l | 34229 | 5.6% |
| o | 30793 | 5.0% |
| t | 27197 | 4.4% |
| s | 23187 | 3.8% |
| h | 19661 | 3.2% |
| Other values (44) | 213522 |
Common
| Value | Count | Frequency (%) |
| 51592 | ||
| ' | 728 | 1.4% |
| - | 225 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 663596 | |
| None | 416 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 61455 | 9.3% |
| e | 60921 | 9.2% |
| n | 51801 | 7.8% |
| 51592 | 7.8% | |
| r | 48359 | 7.3% |
| i | 40342 | 6.1% |
| l | 34229 | 5.2% |
| o | 30793 | 4.6% |
| t | 27197 | 4.1% |
| s | 23187 | 3.5% |
| Other values (44) | 233720 |
None
| Value | Count | Frequency (%) |
| ö | 293 | |
| ä | 76 | 18.3% |
| ü | 47 | 11.3% |
Segment
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 400.8 KiB |
| Consumer | |
|---|---|
| Corporate | |
| Home Office |
Length
| Max length | 11 |
|---|---|
| Median length | 8 |
| Mean length | 8.8472997 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Consumer |
|---|---|
| 2nd row | Corporate |
| 3rd row | Consumer |
| 4th row | Home Office |
| 5th row | Consumer |
Common Values
| Value | Count | Frequency (%) |
| Consumer | 26518 | |
| Corporate | 15429 | |
| Home Office | 9343 | 18.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| consumer | 26518 | |
| corporate | 15429 | |
| home | 9343 | 15.4% |
| office | 9343 | 15.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 66719 | |
| e | 60633 | |
| r | 57376 | |
| C | 41947 | |
| m | 35861 | |
| u | 26518 | 5.8% |
| s | 26518 | 5.8% |
| n | 26518 | 5.8% |
| f | 18686 | 4.1% |
| p | 15429 | 3.4% |
| Other values (7) | 77573 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 383802 | |
| Uppercase Letter | 60633 | 13.4% |
| Space Separator | 9343 | 2.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 66719 | |
| e | 60633 | |
| r | 57376 | |
| m | 35861 | |
| u | 26518 | 6.9% |
| s | 26518 | 6.9% |
| n | 26518 | 6.9% |
| f | 18686 | 4.9% |
| p | 15429 | 4.0% |
| a | 15429 | 4.0% |
| Other values (3) | 34115 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 41947 | |
| H | 9343 | 15.4% |
| O | 9343 | 15.4% |
Space Separator
| Value | Count | Frequency (%) |
| 9343 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 444435 | |
| Common | 9343 | 2.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 66719 | |
| e | 60633 | |
| r | 57376 | |
| C | 41947 | |
| m | 35861 | |
| u | 26518 | 6.0% |
| s | 26518 | 6.0% |
| n | 26518 | 6.0% |
| f | 18686 | 4.2% |
| p | 15429 | 3.5% |
| Other values (6) | 68230 |
Common
| Value | Count | Frequency (%) |
| 9343 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 453778 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 66719 | |
| e | 60633 | |
| r | 57376 | |
| C | 41947 | |
| m | 35861 | |
| u | 26518 | 5.8% |
| s | 26518 | 5.8% |
| n | 26518 | 5.8% |
| f | 18686 | 4.1% |
| p | 15429 | 3.4% |
| Other values (7) | 77573 |
City
Text
| Distinct | 3636 |
|---|---|
| Distinct (%) | 7.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 400.8 KiB |
Length
| Max length | 35 |
|---|---|
| Median length | 29 |
| Mean length | 8.419302 |
| Min length | 2 |
Unique
| Unique | 488 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | New York City |
|---|---|
| 2nd row | Wollongong |
| 3rd row | Brisbane |
| 4th row | Berlin |
| 5th row | Dakar |
| Value | Count | Frequency (%) |
| city | 1789 | 2.8% |
| san | 1671 | 2.6% |
| new | 958 | 1.5% |
| york | 950 | 1.5% |
| los | 874 | 1.4% |
| angeles | 751 | 1.2% |
| de | 599 | 0.9% |
| francisco | 557 | 0.9% |
| philadelphia | 537 | 0.8% |
| santo | 465 | 0.7% |
| Other values (3806) | 54688 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 55167 | 12.8% |
| n | 32502 | 7.5% |
| e | 31845 | 7.4% |
| o | 30443 | 7.0% |
| i | 27026 | 6.3% |
| r | 23842 | 5.5% |
| l | 21403 | 5.0% |
| s | 16146 | 3.7% |
| t | 15964 | 3.7% |
| u | 15609 | 3.6% |
| Other values (66) | 161879 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 354182 | |
| Uppercase Letter | 63411 | 14.7% |
| Space Separator | 12549 | 2.9% |
| Dash Punctuation | 1318 | 0.3% |
| Other Punctuation | 356 | 0.1% |
| Open Punctuation | 4 | < 0.1% |
| Close Punctuation | 4 | < 0.1% |
| Control | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 55167 | |
| n | 32502 | 9.2% |
| e | 31845 | 9.0% |
| o | 30443 | 8.6% |
| i | 27026 | 7.6% |
| r | 23842 | 6.7% |
| l | 21403 | 6.0% |
| s | 16146 | 4.6% |
| t | 15964 | 4.5% |
| u | 15609 | 4.4% |
| Other values (31) | 84235 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 7463 | |
| C | 7203 | |
| M | 6059 | 9.6% |
| B | 4731 | 7.5% |
| L | 4204 | 6.6% |
| A | 4133 | 6.5% |
| P | 4011 | 6.3% |
| T | 2652 | 4.2% |
| D | 2554 | 4.0% |
| N | 2447 | 3.9% |
| Other values (17) | 17954 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 341 | |
| . | 8 | 2.2% |
| ? | 7 | 2.0% |
Space Separator
| Value | Count | Frequency (%) |
| 12549 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1318 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4 |
Control
| Value | Count | Frequency (%) |
| | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 417593 | |
| Common | 14233 | 3.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 55167 | 13.2% |
| n | 32502 | 7.8% |
| e | 31845 | 7.6% |
| o | 30443 | 7.3% |
| i | 27026 | 6.5% |
| r | 23842 | 5.7% |
| l | 21403 | 5.1% |
| s | 16146 | 3.9% |
| t | 15964 | 3.8% |
| u | 15609 | 3.7% |
| Other values (58) | 147646 |
Common
| Value | Count | Frequency (%) |
| 12549 | ||
| - | 1318 | 9.3% |
| ' | 341 | 2.4% |
| . | 8 | 0.1% |
| ? | 7 | < 0.1% |
| ( | 4 | < 0.1% |
| ) | 4 | < 0.1% |
| | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 429396 | |
| None | 2430 | 0.6% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 55167 | 12.8% |
| n | 32502 | 7.6% |
| e | 31845 | 7.4% |
| o | 30443 | 7.1% |
| i | 27026 | 6.3% |
| r | 23842 | 5.6% |
| l | 21403 | 5.0% |
| s | 16146 | 3.8% |
| t | 15964 | 3.7% |
| u | 15609 | 3.6% |
| Other values (49) | 159449 |
None
| Value | Count | Frequency (%) |
| á | 645 | |
| í | 507 | |
| ó | 418 | |
| é | 290 | |
| ã | 261 | |
| ú | 89 | 3.7% |
| ü | 55 | 2.3% |
| ç | 52 | 2.1% |
| ñ | 34 | 1.4% |
| Á | 32 | 1.3% |
| Other values (7) | 47 | 1.9% |
State
Text
| Distinct | 1094 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 400.8 KiB |
Length
| Max length | 36 |
|---|---|
| Median length | 26 |
| Mean length | 9.6409826 |
| Min length | 3 |
Unique
| Unique | 64 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | New York |
|---|---|
| 2nd row | New South Wales |
| 3rd row | Queensland |
| 4th row | Berlin |
| 5th row | Dakar |
| Value | Count | Frequency (%) |
| california | 2125 | 3.1% |
| new | 2103 | 3.1% |
| england | 1499 | 2.2% |
| south | 1201 | 1.8% |
| north | 1145 | 1.7% |
| york | 1128 | 1.7% |
| texas | 985 | 1.4% |
| ile-de-france | 981 | 1.4% |
| wales | 817 | 1.2% |
| capital | 784 | 1.1% |
| Other values (1189) | 55429 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 72974 | |
| n | 39413 | 8.0% |
| i | 33410 | 6.8% |
| e | 30993 | 6.3% |
| o | 28369 | 5.7% |
| r | 28361 | 5.7% |
| l | 23890 | 4.8% |
| t | 21263 | 4.3% |
| s | 19509 | 3.9% |
| 16907 | 3.4% | |
| Other values (71) | 179397 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 398796 | |
| Uppercase Letter | 71791 | 14.5% |
| Space Separator | 16907 | 3.4% |
| Dash Punctuation | 5834 | 1.2% |
| Other Punctuation | 946 | 0.2% |
| Open Punctuation | 103 | < 0.1% |
| Close Punctuation | 103 | < 0.1% |
| Control | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 72974 | |
| n | 39413 | |
| i | 33410 | 8.4% |
| e | 30993 | 7.8% |
| o | 28369 | 7.1% |
| r | 28361 | 7.1% |
| l | 23890 | 6.0% |
| t | 21263 | 5.3% |
| s | 19509 | 4.9% |
| u | 14632 | 3.7% |
| Other values (35) | 85982 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 7579 | 10.6% |
| S | 6867 | 9.6% |
| A | 5516 | 7.7% |
| N | 5001 | 7.0% |
| M | 4112 | 5.7% |
| P | 4003 | 5.6% |
| B | 3595 | 5.0% |
| T | 3083 | 4.3% |
| W | 3016 | 4.2% |
| F | 2601 | 3.6% |
| Other values (17) | 26418 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 764 | |
| ? | 135 | 14.3% |
| . | 47 | 5.0% |
Control
| Value | Count | Frequency (%) |
| | 4 | |
| | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 16907 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5834 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 103 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 103 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 470587 | |
| Common | 23899 | 4.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 72974 | |
| n | 39413 | 8.4% |
| i | 33410 | 7.1% |
| e | 30993 | 6.6% |
| o | 28369 | 6.0% |
| r | 28361 | 6.0% |
| l | 23890 | 5.1% |
| t | 21263 | 4.5% |
| s | 19509 | 4.1% |
| u | 14632 | 3.1% |
| Other values (62) | 157773 |
Common
| Value | Count | Frequency (%) |
| 16907 | ||
| - | 5834 | 24.4% |
| ' | 764 | 3.2% |
| ? | 135 | 0.6% |
| ( | 103 | 0.4% |
| ) | 103 | 0.4% |
| . | 47 | 0.2% |
| | 4 | < 0.1% |
| | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 489990 | |
| None | 4496 | 0.9% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 72974 | |
| n | 39413 | 8.0% |
| i | 33410 | 6.8% |
| e | 30993 | 6.3% |
| o | 28369 | 5.8% |
| r | 28361 | 5.8% |
| l | 23890 | 4.9% |
| t | 21263 | 4.3% |
| s | 19509 | 4.0% |
| 16907 | 3.5% | |
| Other values (49) | 174901 |
None
| Value | Count | Frequency (%) |
| é | 935 | |
| á | 875 | |
| í | 714 | |
| ô | 672 | |
| ã | 473 | |
| ó | 291 | 6.5% |
| ü | 260 | 5.8% |
| è | 63 | 1.4% |
| à | 58 | 1.3% |
| Á | 30 | 0.7% |
| Other values (12) | 125 | 2.8% |
Country
Text
| Distinct | 147 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 400.8 KiB |
Length
| Max length | 32 |
|---|---|
| Median length | 22 |
| Mean length | 8.8366738 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | Australia |
| 3rd row | Australia |
| 4th row | Germany |
| 5th row | Senegal |
| Value | Count | Frequency (%) |
| united | 11641 | 17.1% |
| states | 9994 | 14.7% |
| australia | 2837 | 4.2% |
| france | 2827 | 4.2% |
| mexico | 2644 | 3.9% |
| germany | 2065 | 3.0% |
| china | 1880 | 2.8% |
| kingdom | 1633 | 2.4% |
| brazil | 1599 | 2.3% |
| india | 1555 | 2.3% |
| Other values (154) | 29420 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 55137 | 12.2% |
| e | 43176 | 9.5% |
| i | 40575 | 9.0% |
| t | 40485 | 8.9% |
| n | 36881 | 8.1% |
| d | 21302 | 4.7% |
| r | 20390 | 4.5% |
| s | 18355 | 4.0% |
| 16805 | 3.7% | |
| o | 14461 | 3.2% |
| Other values (44) | 145666 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 368751 | |
| Uppercase Letter | 67287 | 14.8% |
| Space Separator | 16805 | 3.7% |
| Open Punctuation | 136 | < 0.1% |
| Close Punctuation | 136 | < 0.1% |
| Other Punctuation | 109 | < 0.1% |
| Dash Punctuation | 9 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 55137 | |
| e | 43176 | |
| i | 40575 | |
| t | 40485 | |
| n | 36881 | |
| d | 21302 | 5.8% |
| r | 20390 | 5.5% |
| s | 18355 | 5.0% |
| o | 14461 | 3.9% |
| l | 13507 | 3.7% |
| Other values (16) | 64482 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 13329 | |
| U | 12131 | |
| I | 5366 | |
| A | 4822 | 7.2% |
| C | 4252 | 6.3% |
| M | 3719 | 5.5% |
| F | 2891 | 4.3% |
| G | 2795 | 4.2% |
| N | 2745 | 4.1% |
| B | 2339 | 3.5% |
| Other values (13) | 12898 |
Space Separator
| Value | Count | Frequency (%) |
| 16805 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 136 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 136 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 109 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 436038 | |
| Common | 17195 | 3.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 55137 | |
| e | 43176 | 9.9% |
| i | 40575 | 9.3% |
| t | 40485 | 9.3% |
| n | 36881 | 8.5% |
| d | 21302 | 4.9% |
| r | 20390 | 4.7% |
| s | 18355 | 4.2% |
| o | 14461 | 3.3% |
| l | 13507 | 3.1% |
| Other values (39) | 131769 |
Common
| Value | Count | Frequency (%) |
| 16805 | ||
| ( | 136 | 0.8% |
| ) | 136 | 0.8% |
| ' | 109 | 0.6% |
| - | 9 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 453233 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 55137 | 12.2% |
| e | 43176 | 9.5% |
| i | 40575 | 9.0% |
| t | 40485 | 8.9% |
| n | 36881 | 8.1% |
| d | 21302 | 4.7% |
| r | 20390 | 4.5% |
| s | 18355 | 4.0% |
| 16805 | 3.7% | |
| o | 14461 | 3.2% |
| Other values (44) | 145666 |
Postal Code
Real number (ℝ)
High correlation  Missing 
| Distinct | 631 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 41296 |
| Missing (%) | 80.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 55190.379 |
| Minimum | 1040 |
|---|---|
| Maximum | 99301 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 400.8 KiB |
Quantile statistics
| Minimum | 1040 |
|---|---|
| 5-th percentile | 10009 |
| Q1 | 23223 |
| median | 56430.5 |
| Q3 | 90008 |
| 95-th percentile | 98006 |
| Maximum | 99301 |
| Range | 98261 |
| Interquartile range (IQR) | 66785 |
Descriptive statistics
| Standard deviation | 32063.693 |
|---|---|
| Coefficient of variation (CV) | 0.58096526 |
| Kurtosis | -1.4930202 |
| Mean | 55190.379 |
| Median Absolute Deviation (MAD) | 33573.5 |
| Skewness | -0.12852552 |
| Sum | 5.5157265 × 108 |
| Variance | 1.0280804 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10035 | 263 | 0.5% |
| 10024 | 230 | 0.4% |
| 10009 | 229 | 0.4% |
| 94122 | 203 | 0.4% |
| 10011 | 193 | 0.4% |
| 94110 | 166 | 0.3% |
| 98105 | 165 | 0.3% |
| 19134 | 160 | 0.3% |
| 98103 | 151 | 0.3% |
| 90049 | 151 | 0.3% |
| Other values (621) | 8083 | 15.8% |
| (Missing) | 41296 |
| Value | Count | Frequency (%) |
| 1040 | 1 | < 0.1% |
| 1453 | 6 | < 0.1% |
| 1752 | 2 | < 0.1% |
| 1810 | 4 | < 0.1% |
| 1841 | 33 | |
| 1852 | 16 | |
| 1915 | 3 | < 0.1% |
| 2038 | 17 | |
| 2138 | 6 | < 0.1% |
| 2148 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 99301 | 6 | < 0.1% |
| 99207 | 7 | < 0.1% |
| 98661 | 5 | < 0.1% |
| 98632 | 3 | < 0.1% |
| 98502 | 5 | < 0.1% |
| 98270 | 2 | < 0.1% |
| 98226 | 3 | < 0.1% |
| 98208 | 1 | < 0.1% |
| 98198 | 7 | < 0.1% |
| 98115 | 112 |
Market
Categorical
High correlation 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 400.8 KiB |
| APAC | |
|---|---|
| LATAM | |
| EU | |
| US | |
| EMEA | |
| Other values (2) |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 3.6148957 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | US |
|---|---|
| 2nd row | APAC |
| 3rd row | APAC |
| 4th row | EU |
| 5th row | Africa |
Common Values
| Value | Count | Frequency (%) |
| APAC | 11002 | |
| LATAM | 10294 | |
| EU | 10000 | |
| US | 9994 | |
| EMEA | 5029 | |
| Africa | 4587 | |
| Canada | 384 | 0.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| apac | 11002 | |
| latam | 10294 | |
| eu | 10000 | |
| us | 9994 | |
| emea | 5029 | |
| africa | 4587 | |
| canada | 384 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 52208 | |
| E | 20058 | 10.8% |
| U | 19994 | 10.8% |
| M | 15323 | 8.3% |
| C | 11386 | 6.1% |
| P | 11002 | 5.9% |
| L | 10294 | 5.6% |
| T | 10294 | 5.6% |
| S | 9994 | 5.4% |
| a | 5739 | 3.1% |
| Other values (6) | 19116 | 10.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 160553 | |
| Lowercase Letter | 24855 | 13.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 52208 | |
| E | 20058 | 12.5% |
| U | 19994 | 12.5% |
| M | 15323 | 9.5% |
| C | 11386 | 7.1% |
| P | 11002 | 6.9% |
| L | 10294 | 6.4% |
| T | 10294 | 6.4% |
| S | 9994 | 6.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5739 | |
| r | 4587 | |
| f | 4587 | |
| i | 4587 | |
| c | 4587 | |
| n | 384 | 1.5% |
| d | 384 | 1.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 185408 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 52208 | |
| E | 20058 | 10.8% |
| U | 19994 | 10.8% |
| M | 15323 | 8.3% |
| C | 11386 | 6.1% |
| P | 11002 | 5.9% |
| L | 10294 | 5.6% |
| T | 10294 | 5.6% |
| S | 9994 | 5.4% |
| a | 5739 | 3.1% |
| Other values (6) | 19116 | 10.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 185408 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 52208 | |
| E | 20058 | 10.8% |
| U | 19994 | 10.8% |
| M | 15323 | 8.3% |
| C | 11386 | 6.1% |
| P | 11002 | 5.9% |
| L | 10294 | 5.6% |
| T | 10294 | 5.6% |
| S | 9994 | 5.4% |
| a | 5739 | 3.1% |
| Other values (6) | 19116 | 10.3% |
Region
Categorical
High correlation 
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 400.8 KiB |
| Central | |
|---|---|
| South | |
| EMEA | |
| North | |
| Africa | |
| Other values (8) |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 6.638643 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | East |
|---|---|
| 2nd row | Oceania |
| 3rd row | Oceania |
| 4th row | Central |
| 5th row | Africa |
Common Values
| Value | Count | Frequency (%) |
| Central | 11117 | |
| South | 6645 | |
| EMEA | 5029 | |
| North | 4785 | |
| Africa | 4587 | |
| Oceania | 3487 | 6.8% |
| West | 3203 | 6.2% |
| Southeast Asia | 3129 | 6.1% |
| East | 2848 | 5.6% |
| North Asia | 2338 | 4.6% |
| Other values (3) | 4122 | 8.0% |
Length
| Value | Count | Frequency (%) |
| central | 13165 | |
| asia | 7515 | |
| north | 7123 | |
| south | 6645 | |
| emea | 5029 | 8.6% |
| africa | 4587 | 7.8% |
| oceania | 3487 | 5.9% |
| west | 3203 | 5.4% |
| southeast | 3129 | 5.3% |
| east | 2848 | 4.8% |
| Other values (2) | 2074 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 42750 | |
| t | 39242 | 11.5% |
| r | 26565 | 7.8% |
| e | 24674 | 7.2% |
| n | 18726 | 5.5% |
| i | 17279 | 5.1% |
| A | 17131 | 5.0% |
| h | 16897 | 5.0% |
| o | 16897 | 5.0% |
| s | 16695 | 4.9% |
| Other values (14) | 103640 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 259089 | |
| Uppercase Letter | 73892 | 21.7% |
| Space Separator | 7515 | 2.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 42750 | |
| t | 39242 | |
| r | 26565 | |
| e | 24674 | |
| n | 18726 | |
| i | 17279 | |
| h | 16897 | 6.5% |
| o | 16897 | 6.5% |
| s | 16695 | 6.4% |
| l | 13165 | 5.1% |
| Other values (5) | 26199 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 17131 | |
| C | 15239 | |
| E | 12906 | |
| S | 9774 | |
| N | 7123 | |
| M | 5029 | 6.8% |
| O | 3487 | 4.7% |
| W | 3203 | 4.3% |
Space Separator
| Value | Count | Frequency (%) |
| 7515 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 332981 | |
| Common | 7515 | 2.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 42750 | |
| t | 39242 | |
| r | 26565 | 8.0% |
| e | 24674 | 7.4% |
| n | 18726 | 5.6% |
| i | 17279 | 5.2% |
| A | 17131 | 5.1% |
| h | 16897 | 5.1% |
| o | 16897 | 5.1% |
| s | 16695 | 5.0% |
| Other values (13) | 96125 |
Common
| Value | Count | Frequency (%) |
| 7515 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 340496 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 42750 | |
| t | 39242 | 11.5% |
| r | 26565 | 7.8% |
| e | 24674 | 7.2% |
| n | 18726 | 5.5% |
| i | 17279 | 5.1% |
| A | 17131 | 5.0% |
| h | 16897 | 5.0% |
| o | 16897 | 5.0% |
| s | 16695 | 4.9% |
| Other values (14) | 103640 |
Product ID
Text
| Distinct | 10292 |
|---|---|
| Distinct (%) | 20.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 400.8 KiB |
Length
| Max length | 16 |
|---|---|
| Median length | 15 |
| Mean length | 15.195009 |
| Min length | 15 |
Unique
| Unique | 1420 ? |
|---|---|
| Unique (%) | 2.8% |
Sample
| 1st row | TEC-AC-10003033 |
|---|---|
| 2nd row | FUR-CH-10003950 |
| 3rd row | TEC-PH-10004664 |
| 4th row | TEC-PH-10004583 |
| 5th row | TEC-SHA-10000501 |
| Value | Count | Frequency (%) |
| tec-hp | 83 | 0.2% |
| off-ar-10003651 | 35 | 0.1% |
| off-ar-10003829 | 31 | 0.1% |
| off-bi-10002799 | 30 | 0.1% |
| off-bi-10003708 | 30 | 0.1% |
| fur-ch-10003354 | 28 | 0.1% |
| off-bi-10002570 | 27 | 0.1% |
| off-bi-10004140 | 25 | < 0.1% |
| off-bi-10001808 | 24 | < 0.1% |
| off-bi-10004632 | 24 | < 0.1% |
| Other values (10283) | 51036 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 179614 | |
| - | 102580 | |
| F | 77961 | |
| 1 | 77220 | |
| O | 37143 | 4.8% |
| 2 | 25593 | 3.3% |
| 3 | 25555 | 3.3% |
| 4 | 25148 | 3.2% |
| A | 20235 | 2.6% |
| C | 19282 | 2.5% |
| Other values (25) | 189021 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 410320 | |
| Uppercase Letter | 266369 | |
| Dash Punctuation | 102580 | 13.2% |
| Space Separator | 83 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 77961 | |
| O | 37143 | |
| A | 20235 | 7.6% |
| C | 19282 | 7.2% |
| T | 16042 | 6.0% |
| E | 15297 | 5.7% |
| U | 14862 | 5.6% |
| R | 14680 | 5.5% |
| S | 8467 | 3.2% |
| B | 8234 | 3.1% |
| Other values (13) | 34166 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 179614 | |
| 1 | 77220 | |
| 2 | 25593 | 6.2% |
| 3 | 25555 | 6.2% |
| 4 | 25148 | 6.1% |
| 5 | 16086 | 3.9% |
| 7 | 15726 | 3.8% |
| 8 | 15335 | 3.7% |
| 9 | 15242 | 3.7% |
| 6 | 14801 | 3.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 102580 |
Space Separator
| Value | Count | Frequency (%) |
| 83 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 512983 | |
| Latin | 266369 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 77961 | |
| O | 37143 | |
| A | 20235 | 7.6% |
| C | 19282 | 7.2% |
| T | 16042 | 6.0% |
| E | 15297 | 5.7% |
| U | 14862 | 5.6% |
| R | 14680 | 5.5% |
| S | 8467 | 3.2% |
| B | 8234 | 3.1% |
| Other values (13) | 34166 |
Common
| Value | Count | Frequency (%) |
| 0 | 179614 | |
| - | 102580 | |
| 1 | 77220 | |
| 2 | 25593 | 5.0% |
| 3 | 25555 | 5.0% |
| 4 | 25148 | 4.9% |
| 5 | 16086 | 3.1% |
| 7 | 15726 | 3.1% |
| 8 | 15335 | 3.0% |
| 9 | 15242 | 3.0% |
| Other values (2) | 14884 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 779352 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 179614 | |
| - | 102580 | |
| F | 77961 | |
| 1 | 77220 | |
| O | 37143 | 4.8% |
| 2 | 25593 | 3.3% |
| 3 | 25555 | 3.3% |
| 4 | 25148 | 3.2% |
| A | 20235 | 2.6% |
| C | 19282 | 2.5% |
| Other values (25) | 189021 |
Category
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 400.8 KiB |
| Office Supplies | |
|---|---|
| Technology | |
| Furniture |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 12.856093 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Technology |
|---|---|
| 2nd row | Furniture |
| 3rd row | Technology |
| 4th row | Technology |
| 5th row | Technology |
Common Values
| Value | Count | Frequency (%) |
| Office Supplies | 31273 | |
| Technology | 10141 | 19.8% |
| Furniture | 9876 | 19.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| office | 31273 | |
| supplies | 31273 | |
| technology | 10141 | 12.3% |
| furniture | 9876 | 12.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 82563 | |
| i | 72422 | |
| p | 62546 | |
| f | 62546 | |
| u | 51025 | 7.7% |
| c | 41414 | 6.3% |
| l | 41414 | 6.3% |
| O | 31273 | 4.7% |
| S | 31273 | 4.7% |
| 31273 | 4.7% | |
| Other values (10) | 151640 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 545553 | |
| Uppercase Letter | 82563 | 12.5% |
| Space Separator | 31273 | 4.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 82563 | |
| i | 72422 | |
| p | 62546 | |
| f | 62546 | |
| u | 51025 | |
| c | 41414 | |
| l | 41414 | |
| s | 31273 | 5.7% |
| o | 20282 | 3.7% |
| n | 20017 | 3.7% |
| Other values (5) | 60051 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 31273 | |
| S | 31273 | |
| T | 10141 | 12.3% |
| F | 9876 | 12.0% |
Space Separator
| Value | Count | Frequency (%) |
| 31273 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 628116 | |
| Common | 31273 | 4.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 82563 | |
| i | 72422 | |
| p | 62546 | |
| f | 62546 | |
| u | 51025 | |
| c | 41414 | 6.6% |
| l | 41414 | 6.6% |
| O | 31273 | 5.0% |
| S | 31273 | 5.0% |
| s | 31273 | 5.0% |
| Other values (9) | 120367 |
Common
| Value | Count | Frequency (%) |
| 31273 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 659389 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 82563 | |
| i | 72422 | |
| p | 62546 | |
| f | 62546 | |
| u | 51025 | 7.7% |
| c | 41414 | 6.3% |
| l | 41414 | 6.3% |
| O | 31273 | 4.7% |
| S | 31273 | 4.7% |
| 31273 | 4.7% | |
| Other values (10) | 151640 |
Sub-Category
Categorical
High correlation 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 400.8 KiB |
| Binders | |
|---|---|
| Storage | |
| Art | |
| Paper | |
| Chairs | |
| Other values (12) |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 7.2304933 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Accessories |
|---|---|
| 2nd row | Chairs |
| 3rd row | Phones |
| 4th row | Phones |
| 5th row | Copiers |
Common Values
| Value | Count | Frequency (%) |
| Binders | 6152 | |
| Storage | 5059 | 9.9% |
| Art | 4883 | 9.5% |
| Paper | 3538 | 6.9% |
| Chairs | 3434 | 6.7% |
| Phones | 3357 | 6.5% |
| Furnishings | 3170 | 6.2% |
| Accessories | 3075 | 6.0% |
| Labels | 2606 | 5.1% |
| Envelopes | 2435 | 4.7% |
| Other values (7) | 13581 |
Length
| Value | Count | Frequency (%) |
| binders | 6152 | |
| storage | 5059 | 9.9% |
| art | 4883 | 9.5% |
| paper | 3538 | 6.9% |
| chairs | 3434 | 6.7% |
| phones | 3357 | 6.5% |
| furnishings | 3170 | 6.2% |
| accessories | 3075 | 6.0% |
| labels | 2606 | 5.1% |
| envelopes | 2435 | 4.7% |
| Other values (7) | 13581 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 51961 | |
| e | 47733 | |
| r | 33954 | 9.2% |
| i | 26890 | 7.3% |
| n | 23945 | 6.5% |
| a | 23570 | 6.4% |
| o | 20971 | 5.7% |
| p | 16556 | 4.5% |
| t | 12362 | 3.3% |
| c | 11802 | 3.2% |
| Other values (18) | 101108 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 319562 | |
| Uppercase Letter | 51290 | 13.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 51961 | |
| e | 47733 | |
| r | 33954 | |
| i | 26890 | |
| n | 23945 | |
| a | 23570 | |
| o | 20971 | |
| p | 16556 | 5.2% |
| t | 12362 | 3.9% |
| c | 11802 | 3.7% |
| Other values (8) | 49818 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 9713 | |
| B | 8563 | |
| S | 7484 | |
| P | 6895 | |
| C | 5657 | |
| F | 5590 | |
| L | 2606 | 5.1% |
| E | 2435 | 4.7% |
| M | 1486 | 2.9% |
| T | 861 | 1.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 370852 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 51961 | |
| e | 47733 | |
| r | 33954 | 9.2% |
| i | 26890 | 7.3% |
| n | 23945 | 6.5% |
| a | 23570 | 6.4% |
| o | 20971 | 5.7% |
| p | 16556 | 4.5% |
| t | 12362 | 3.3% |
| c | 11802 | 3.2% |
| Other values (18) | 101108 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 370852 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 51961 | |
| e | 47733 | |
| r | 33954 | 9.2% |
| i | 26890 | 7.3% |
| n | 23945 | 6.5% |
| a | 23570 | 6.4% |
| o | 20971 | 5.7% |
| p | 16556 | 4.5% |
| t | 12362 | 3.3% |
| c | 11802 | 3.2% |
| Other values (18) | 101108 |
Product Name
Text
| Distinct | 3788 |
|---|---|
| Distinct (%) | 7.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 400.8 KiB |
Length
| Max length | 127 |
|---|---|
| Median length | 89 |
| Mean length | 30.856931 |
| Min length | 5 |
Unique
| Unique | 98 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Plantronics CS510 - Over-the-Head monaural Wireless Headset System |
|---|---|
| 2nd row | Novimex Executive Leather Armchair, Black |
| 3rd row | Nokia Smart Phone, with Caller ID |
| 4th row | Motorola Smart Phone, Cordless |
| 5th row | Sharp Wireless Fax, High-Speed |
| Value | Count | Frequency (%) |
| labels | 2385 | 1.0% |
| recycled | 2291 | 1.0% |
| color | 2187 | 0.9% |
| with | 2177 | 0.9% |
| set | 2106 | 0.9% |
| blue | 2092 | 0.9% |
| durable | 2072 | 0.9% |
| black | 2055 | 0.9% |
| avery | 1920 | 0.8% |
| clear | 1893 | 0.8% |
| Other values (2826) | 210320 |
Most occurring characters
| Value | Count | Frequency (%) |
| 179838 | 11.4% | |
| e | 154618 | 9.8% |
| a | 94421 | 6.0% |
| r | 91563 | 5.8% |
| o | 88370 | 5.6% |
| l | 79902 | 5.0% |
| i | 79392 | 5.0% |
| n | 68089 | 4.3% |
| t | 62491 | 3.9% |
| s | 60638 | 3.8% |
| Other values (75) | 623330 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1084788 | |
| Uppercase Letter | 235084 | 14.9% |
| Space Separator | 180265 | 11.4% |
| Other Punctuation | 50142 | 3.2% |
| Decimal Number | 25561 | 1.6% |
| Dash Punctuation | 6566 | 0.4% |
| Control | 86 | < 0.1% |
| Close Punctuation | 60 | < 0.1% |
| Open Punctuation | 60 | < 0.1% |
| Math Symbol | 35 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 154618 | |
| a | 94421 | 8.7% |
| r | 91563 | 8.4% |
| o | 88370 | 8.1% |
| l | 79902 | 7.4% |
| i | 79392 | 7.3% |
| n | 68089 | 6.3% |
| t | 62491 | 5.8% |
| s | 60638 | 5.6% |
| c | 43284 | 4.0% |
| Other values (18) | 262020 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 33233 | |
| C | 27670 | |
| B | 22724 | 9.7% |
| P | 18037 | 7.7% |
| E | 12943 | 5.5% |
| A | 12468 | 5.3% |
| F | 12209 | 5.2% |
| M | 10589 | 4.5% |
| R | 10364 | 4.4% |
| T | 10107 | 4.3% |
| Other values (16) | 64740 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 44416 | |
| / | 1561 | 3.1% |
| & | 1446 | 2.9% |
| " | 1300 | 2.6% |
| . | 998 | 2.0% |
| ' | 257 | 0.5% |
| # | 90 | 0.2% |
| % | 45 | 0.1% |
| * | 9 | < 0.1% |
| ! | 9 | < 0.1% |
| Other values (2) | 11 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5377 | |
| 0 | 5118 | |
| 5 | 3094 | |
| 2 | 2756 | |
| 3 | 2628 | |
| 8 | 1808 | 7.1% |
| 4 | 1725 | 6.7% |
| 9 | 1234 | 4.8% |
| 6 | 941 | 3.7% |
| 7 | 880 | 3.4% |
Space Separator
| Value | Count | Frequency (%) |
| 179838 | ||
| 427 | 0.2% |
Control
| Value | Count | Frequency (%) |
| | 67 | |
| | 19 | 22.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6566 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 60 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 60 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 35 |
Other Number
| Value | Count | Frequency (%) |
| ¾ | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1319872 | |
| Common | 262780 | 16.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 154618 | 11.7% |
| a | 94421 | 7.2% |
| r | 91563 | 6.9% |
| o | 88370 | 6.7% |
| l | 79902 | 6.1% |
| i | 79392 | 6.0% |
| n | 68089 | 5.2% |
| t | 62491 | 4.7% |
| s | 60638 | 4.6% |
| c | 43284 | 3.3% |
| Other values (44) | 497104 |
Common
| Value | Count | Frequency (%) |
| 179838 | ||
| , | 44416 | 16.9% |
| - | 6566 | 2.5% |
| 1 | 5377 | 2.0% |
| 0 | 5118 | 1.9% |
| 5 | 3094 | 1.2% |
| 2 | 2756 | 1.0% |
| 3 | 2628 | 1.0% |
| 8 | 1808 | 0.7% |
| 4 | 1725 | 0.7% |
| Other values (21) | 9454 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1582117 | |
| None | 535 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 179838 | 11.4% | |
| e | 154618 | 9.8% |
| a | 94421 | 6.0% |
| r | 91563 | 5.8% |
| o | 88370 | 5.6% |
| l | 79902 | 5.1% |
| i | 79392 | 5.0% |
| n | 68089 | 4.3% |
| t | 62491 | 3.9% |
| s | 60638 | 3.8% |
| Other values (69) | 622795 |
None
| Value | Count | Frequency (%) |
| 427 | ||
| | 67 | 12.5% |
| | 19 | 3.6% |
| é | 14 | 2.6% |
| ¾ | 5 | 0.9% |
| à | 3 | 0.6% |
Sales
Real number (ℝ)
High correlation 
| Distinct | 22995 |
|---|---|
| Distinct (%) | 44.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 246.49058 |
| Minimum | 0.444 |
|---|---|
| Maximum | 22638.48 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 400.8 KiB |
Quantile statistics
| Minimum | 0.444 |
|---|---|
| 5-th percentile | 8.8 |
| Q1 | 30.758625 |
| median | 85.053 |
| Q3 | 251.0532 |
| 95-th percentile | 1015.9556 |
| Maximum | 22638.48 |
| Range | 22638.036 |
| Interquartile range (IQR) | 220.29458 |
Descriptive statistics
| Standard deviation | 487.56536 |
|---|---|
| Coefficient of variation (CV) | 1.9780284 |
| Kurtosis | 176.7312 |
| Mean | 246.49058 |
| Median Absolute Deviation (MAD) | 67.0062 |
| Skewness | 8.13808 |
| Sum | 12642502 |
| Variance | 237719.98 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12.96 | 66 | 0.1% |
| 25.92 | 50 | 0.1% |
| 19.44 | 43 | 0.1% |
| 32.4 | 42 | 0.1% |
| 15.552 | 41 | 0.1% |
| 10.368 | 38 | 0.1% |
| 26.88 | 36 | 0.1% |
| 24 | 36 | 0.1% |
| 26.4 | 35 | 0.1% |
| 17.52 | 31 | 0.1% |
| Other values (22985) | 50872 |
| Value | Count | Frequency (%) |
| 0.444 | 1 | < 0.1% |
| 0.556 | 1 | < 0.1% |
| 0.836 | 1 | < 0.1% |
| 0.852 | 1 | < 0.1% |
| 0.876 | 1 | < 0.1% |
| 0.898 | 1 | < 0.1% |
| 0.984 | 1 | < 0.1% |
| 0.99 | 1 | < 0.1% |
| 1.044 | 1 | < 0.1% |
| 1.08 | 3 |
| Value | Count | Frequency (%) |
| 22638.48 | 1 | |
| 17499.95 | 1 | |
| 13999.96 | 1 | |
| 11199.968 | 1 | |
| 10499.97 | 1 | |
| 9892.74 | 1 | |
| 9449.95 | 1 | |
| 9099.93 | 1 | |
| 8749.95 | 1 | |
| 8399.976 | 1 |
Quantity
Real number (ℝ)
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.4765451 |
| Minimum | 1 |
|---|---|
| Maximum | 14 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 400.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 8 |
| Maximum | 14 |
| Range | 13 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.2787663 |
|---|---|
| Coefficient of variation (CV) | 0.65546864 |
| Kurtosis | 2.2758887 |
| Mean | 3.4765451 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.3603677 |
| Sum | 178312 |
| Variance | 5.1927759 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 12748 | |
| 3 | 9682 | |
| 1 | 8963 | |
| 4 | 6385 | |
| 5 | 4882 | 9.5% |
| 6 | 3020 | 5.9% |
| 7 | 2385 | 4.7% |
| 8 | 1361 | 2.7% |
| 9 | 987 | 1.9% |
| 10 | 276 | 0.5% |
| Other values (4) | 601 | 1.2% |
| Value | Count | Frequency (%) |
| 1 | 8963 | |
| 2 | 12748 | |
| 3 | 9682 | |
| 4 | 6385 | |
| 5 | 4882 | 9.5% |
| 6 | 3020 | 5.9% |
| 7 | 2385 | 4.7% |
| 8 | 1361 | 2.7% |
| 9 | 987 | 1.9% |
| 10 | 276 | 0.5% |
| Value | Count | Frequency (%) |
| 14 | 186 | 0.4% |
| 13 | 83 | 0.2% |
| 12 | 176 | 0.3% |
| 11 | 156 | 0.3% |
| 10 | 276 | 0.5% |
| 9 | 987 | 1.9% |
| 8 | 1361 | 2.7% |
| 7 | 2385 | |
| 6 | 3020 | |
| 5 | 4882 |
Discount
Real number (ℝ)
High correlation  Zeros 
| Distinct | 27 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.14290755 |
| Minimum | 0 |
|---|---|
| Maximum | 0.85 |
| Zeros | 29009 |
| Zeros (%) | 56.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 400.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0.2 |
| 95-th percentile | 0.6 |
| Maximum | 0.85 |
| Range | 0.85 |
| Interquartile range (IQR) | 0.2 |
Descriptive statistics
| Standard deviation | 0.21227993 |
|---|---|
| Coefficient of variation (CV) | 1.4854354 |
| Kurtosis | 0.71668241 |
| Mean | 0.14290755 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.3877746 |
| Sum | 7329.728 |
| Variance | 0.045062769 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 29009 | |
| 0.2 | 4998 | 9.7% |
| 0.1 | 4068 | 7.9% |
| 0.4 | 3177 | 6.2% |
| 0.6 | 2006 | 3.9% |
| 0.7 | 1786 | 3.5% |
| 0.5 | 1633 | 3.2% |
| 0.17 | 735 | 1.4% |
| 0.47 | 725 | 1.4% |
| 0.15 | 541 | 1.1% |
| Other values (17) | 2612 | 5.1% |
| Value | Count | Frequency (%) |
| 0 | 29009 | |
| 0.002 | 461 | 0.9% |
| 0.07 | 150 | 0.3% |
| 0.1 | 4068 | 7.9% |
| 0.15 | 541 | 1.1% |
| 0.17 | 735 | 1.4% |
| 0.2 | 4998 | 9.7% |
| 0.202 | 41 | 0.1% |
| 0.25 | 198 | 0.4% |
| 0.27 | 388 | 0.8% |
| Value | Count | Frequency (%) |
| 0.85 | 2 | < 0.1% |
| 0.8 | 316 | 0.6% |
| 0.7 | 1786 | |
| 0.65 | 17 | < 0.1% |
| 0.602 | 23 | < 0.1% |
| 0.6 | 2006 | |
| 0.57 | 12 | < 0.1% |
| 0.55 | 10 | < 0.1% |
| 0.5 | 1633 | |
| 0.47 | 725 | 1.4% |
Profit
Real number (ℝ)
High correlation  Zeros 
| Distinct | 24575 |
|---|---|
| Distinct (%) | 47.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.610982 |
| Minimum | -6599.978 |
|---|---|
| Maximum | 8399.976 |
| Zeros | 668 |
| Zeros (%) | 1.3% |
| Negative | 12544 |
| Negative (%) | 24.5% |
| Memory size | 400.8 KiB |
Quantile statistics
| Minimum | -6599.978 |
|---|---|
| 5-th percentile | -83.90475 |
| Q1 | 0 |
| median | 9.24 |
| Q3 | 36.81 |
| 95-th percentile | 211.5 |
| Maximum | 8399.976 |
| Range | 14999.954 |
| Interquartile range (IQR) | 36.81 |
Descriptive statistics
| Standard deviation | 174.34097 |
|---|---|
| Coefficient of variation (CV) | 6.0934983 |
| Kurtosis | 291.41109 |
| Mean | 28.610982 |
| Median Absolute Deviation (MAD) | 15.96 |
| Skewness | 4.1571885 |
| Sum | 1467457.3 |
| Variance | 30394.774 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 668 | 1.3% |
| 4.32 | 70 | 0.1% |
| 3.96 | 69 | 0.1% |
| 7.92 | 67 | 0.1% |
| 2.64 | 63 | 0.1% |
| 2.88 | 60 | 0.1% |
| 6.84 | 57 | 0.1% |
| 9 | 56 | 0.1% |
| 0.48 | 55 | 0.1% |
| 3.42 | 55 | 0.1% |
| Other values (24565) | 50070 |
| Value | Count | Frequency (%) |
| -6599.978 | 1 | |
| -4088.376 | 1 | |
| -3839.9904 | 1 | |
| -3701.8928 | 1 | |
| -3399.98 | 1 | |
| -3059.82 | 1 | |
| -3009.435 | 1 | |
| -2929.4845 | 1 | |
| -2750.28 | 1 | |
| -2639.9912 | 1 |
| Value | Count | Frequency (%) |
| 8399.976 | 1 | |
| 6719.9808 | 1 | |
| 5039.9856 | 1 | |
| 4946.37 | 1 | |
| 4630.4755 | 1 | |
| 3979.08 | 1 | |
| 3919.9888 | 1 | |
| 3177.475 | 1 | |
| 2939.31 | 1 | |
| 2817.99 | 1 |
Shipping Cost
Real number (ℝ)
High correlation 
| Distinct | 10037 |
|---|---|
| Distinct (%) | 19.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.375915 |
| Minimum | 0 |
|---|---|
| Maximum | 933.57 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 400.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.61 |
| Q1 | 2.61 |
| median | 7.79 |
| Q3 | 24.45 |
| 95-th percentile | 111.4095 |
| Maximum | 933.57 |
| Range | 933.57 |
| Interquartile range (IQR) | 21.84 |
Descriptive statistics
| Standard deviation | 57.296804 |
|---|---|
| Coefficient of variation (CV) | 2.1723153 |
| Kurtosis | 50.020158 |
| Mean | 26.375915 |
| Median Absolute Deviation (MAD) | 6.41 |
| Skewness | 5.8632264 |
| Sum | 1352820.7 |
| Variance | 3282.9237 |
| Monotonicity | Decreasing |
| Value | Count | Frequency (%) |
| 0.86 | 76 | 0.1% |
| 0.71 | 75 | 0.1% |
| 1.26 | 75 | 0.1% |
| 1.36 | 74 | 0.1% |
| 0.35 | 73 | 0.1% |
| 0.94 | 71 | 0.1% |
| 1.04 | 71 | 0.1% |
| 0.79 | 71 | 0.1% |
| 0.69 | 70 | 0.1% |
| 1.3 | 70 | 0.1% |
| Other values (10027) | 50564 |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 0.01 | 6 | < 0.1% |
| 0.02 | 9 | < 0.1% |
| 0.03 | 9 | < 0.1% |
| 0.04 | 14 | |
| 0.05 | 19 | |
| 0.06 | 18 | |
| 0.07 | 13 | |
| 0.08 | 17 | |
| 0.09 | 23 |
| Value | Count | Frequency (%) |
| 933.57 | 1 | |
| 923.63 | 1 | |
| 915.49 | 1 | |
| 910.16 | 1 | |
| 903.04 | 1 | |
| 897.35 | 1 | |
| 894.77 | 1 | |
| 878.38 | 1 | |
| 867.69 | 1 | |
| 865.74 | 1 |
Order Priority
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 400.8 KiB |
| Medium | |
|---|---|
| High | |
| Critical | |
| Low | 2424 |
Length
| Max length | 8 |
|---|---|
| Median length | 6 |
| Mean length | 5.4070969 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Critical |
|---|---|
| 2nd row | Critical |
| 3rd row | Medium |
| 4th row | Medium |
| 5th row | Critical |
Common Values
| Value | Count | Frequency (%) |
| Medium | 29433 | |
| High | 15501 | |
| Critical | 3932 | 7.7% |
| Low | 2424 | 4.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| medium | 29433 | |
| high | 15501 | |
| critical | 3932 | 7.7% |
| low | 2424 | 4.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 52798 | |
| M | 29433 | |
| e | 29433 | |
| d | 29433 | |
| u | 29433 | |
| m | 29433 | |
| H | 15501 | 5.6% |
| g | 15501 | 5.6% |
| h | 15501 | 5.6% |
| C | 3932 | 1.4% |
| Other values (8) | 26932 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 226040 | |
| Uppercase Letter | 51290 | 18.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 52798 | |
| e | 29433 | |
| d | 29433 | |
| u | 29433 | |
| m | 29433 | |
| g | 15501 | 6.9% |
| h | 15501 | 6.9% |
| r | 3932 | 1.7% |
| t | 3932 | 1.7% |
| c | 3932 | 1.7% |
| Other values (4) | 12712 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 29433 | |
| H | 15501 | |
| C | 3932 | 7.7% |
| L | 2424 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 277330 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 52798 | |
| M | 29433 | |
| e | 29433 | |
| d | 29433 | |
| u | 29433 | |
| m | 29433 | |
| H | 15501 | 5.6% |
| g | 15501 | 5.6% |
| h | 15501 | 5.6% |
| C | 3932 | 1.4% |
| Other values (8) | 26932 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 277330 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 52798 | |
| M | 29433 | |
| e | 29433 | |
| d | 29433 | |
| u | 29433 | |
| m | 29433 | |
| H | 15501 | 5.6% |
| g | 15501 | 5.6% |
| h | 15501 | 5.6% |
| C | 3932 | 1.4% |
| Other values (8) | 26932 |
Interactions
Correlations
| Category | Discount | Market | Order Priority | Postal Code | Profit | Quantity | Region | Row ID | Sales | Segment | Ship Mode | Shipping Cost | Sub-Category | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Category | 1.000 | 0.185 | 0.073 | 0.003 | 0.000 | 0.056 | 0.015 | 0.056 | 0.070 | 0.062 | 0.000 | 0.000 | 0.156 | 1.000 |
| Discount | 0.185 | 1.000 | 0.348 | 0.011 | 0.053 | -0.596 | 0.018 | 0.335 | 0.017 | -0.100 | 0.000 | 0.019 | -0.094 | 0.159 |
| Market | 0.073 | 0.348 | 1.000 | 0.013 | 1.000 | 0.009 | 0.157 | 0.882 | 0.797 | 0.021 | 0.009 | 0.014 | 0.032 | 0.128 |
| Order Priority | 0.003 | 0.011 | 0.013 | 1.000 | 0.039 | 0.009 | 0.008 | 0.027 | 0.017 | 0.000 | 0.019 | 0.284 | 0.099 | 0.004 |
| Postal Code | 0.000 | 0.053 | 1.000 | 0.039 | 1.000 | -0.005 | 0.014 | 0.921 | 0.011 | -0.002 | 0.035 | 0.038 | -0.005 | 0.000 |
| Profit | 0.056 | -0.596 | 0.009 | 0.009 | -0.005 | 1.000 | 0.201 | 0.010 | -0.046 | 0.490 | 0.000 | 0.011 | 0.449 | 0.058 |
| Quantity | 0.015 | 0.018 | 0.157 | 0.008 | 0.014 | 0.201 | 1.000 | 0.129 | -0.239 | 0.416 | 0.000 | 0.008 | 0.379 | 0.016 |
| Region | 0.056 | 0.335 | 0.882 | 0.027 | 0.921 | 0.010 | 0.129 | 1.000 | 0.532 | 0.019 | 0.012 | 0.026 | 0.028 | 0.069 |
| Row ID | 0.070 | 0.017 | 0.797 | 0.017 | 0.011 | -0.046 | -0.239 | 0.532 | 1.000 | -0.143 | 0.018 | 0.012 | -0.129 | 0.100 |
| Sales | 0.062 | -0.100 | 0.021 | 0.000 | -0.002 | 0.490 | 0.416 | 0.019 | -0.143 | 1.000 | 0.000 | 0.000 | 0.913 | 0.061 |
| Segment | 0.000 | 0.000 | 0.009 | 0.019 | 0.035 | 0.000 | 0.000 | 0.012 | 0.018 | 0.000 | 1.000 | 0.011 | 0.000 | 0.012 |
| Ship Mode | 0.000 | 0.019 | 0.014 | 0.284 | 0.038 | 0.011 | 0.008 | 0.026 | 0.012 | 0.000 | 0.011 | 1.000 | 0.074 | 0.011 |
| Shipping Cost | 0.156 | -0.094 | 0.032 | 0.099 | -0.005 | 0.449 | 0.379 | 0.028 | -0.129 | 0.913 | 0.000 | 0.074 | 1.000 | 0.112 |
| Sub-Category | 1.000 | 0.159 | 0.128 | 0.004 | 0.000 | 0.058 | 0.016 | 0.069 | 0.100 | 0.061 | 0.012 | 0.011 | 0.112 | 1.000 |
Missing values
Sample
| Row ID | Order ID | Order Date | Ship Date | Ship Mode | Customer ID | Customer Name | Segment | City | State | Country | Postal Code | Market | Region | Product ID | Category | Sub-Category | Product Name | Sales | Quantity | Discount | Profit | Shipping Cost | Order Priority | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 32298 | CA-2012-124891 | 31-07-2012 | 31-07-2012 | Same Day | RH-19495 | Rick Hansen | Consumer | New York City | New York | United States | 10024.0 | US | East | TEC-AC-10003033 | Technology | Accessories | Plantronics CS510 - Over-the-Head monaural Wireless Headset System | 2309.650 | 7 | 0.0 | 762.1845 | 933.57 | Critical |
| 1 | 26341 | IN-2013-77878 | 05-02-2013 | 07-02-2013 | Second Class | JR-16210 | Justin Ritter | Corporate | Wollongong | New South Wales | Australia | NaN | APAC | Oceania | FUR-CH-10003950 | Furniture | Chairs | Novimex Executive Leather Armchair, Black | 3709.395 | 9 | 0.1 | -288.7650 | 923.63 | Critical |
| 2 | 25330 | IN-2013-71249 | 17-10-2013 | 18-10-2013 | First Class | CR-12730 | Craig Reiter | Consumer | Brisbane | Queensland | Australia | NaN | APAC | Oceania | TEC-PH-10004664 | Technology | Phones | Nokia Smart Phone, with Caller ID | 5175.171 | 9 | 0.1 | 919.9710 | 915.49 | Medium |
| 3 | 13524 | ES-2013-1579342 | 28-01-2013 | 30-01-2013 | First Class | KM-16375 | Katherine Murray | Home Office | Berlin | Berlin | Germany | NaN | EU | Central | TEC-PH-10004583 | Technology | Phones | Motorola Smart Phone, Cordless | 2892.510 | 5 | 0.1 | -96.5400 | 910.16 | Medium |
| 4 | 47221 | SG-2013-4320 | 05-11-2013 | 06-11-2013 | Same Day | RH-9495 | Rick Hansen | Consumer | Dakar | Dakar | Senegal | NaN | Africa | Africa | TEC-SHA-10000501 | Technology | Copiers | Sharp Wireless Fax, High-Speed | 2832.960 | 8 | 0.0 | 311.5200 | 903.04 | Critical |
| 5 | 22732 | IN-2013-42360 | 28-06-2013 | 01-07-2013 | Second Class | JM-15655 | Jim Mitchum | Corporate | Sydney | New South Wales | Australia | NaN | APAC | Oceania | TEC-PH-10000030 | Technology | Phones | Samsung Smart Phone, with Caller ID | 2862.675 | 5 | 0.1 | 763.2750 | 897.35 | Critical |
| 6 | 30570 | IN-2011-81826 | 07-11-2011 | 09-11-2011 | First Class | TS-21340 | Toby Swindell | Consumer | Porirua | Wellington | New Zealand | NaN | APAC | Oceania | FUR-CH-10004050 | Furniture | Chairs | Novimex Executive Leather Armchair, Adjustable | 1822.080 | 4 | 0.0 | 564.8400 | 894.77 | Critical |
| 7 | 31192 | IN-2012-86369 | 14-04-2012 | 18-04-2012 | Standard Class | MB-18085 | Mick Brown | Consumer | Hamilton | Waikato | New Zealand | NaN | APAC | Oceania | FUR-TA-10002958 | Furniture | Tables | Chromcraft Conference Table, Fully Assembled | 5244.840 | 6 | 0.0 | 996.4800 | 878.38 | High |
| 8 | 40155 | CA-2014-135909 | 14-10-2014 | 21-10-2014 | Standard Class | JW-15220 | Jane Waco | Corporate | Sacramento | California | United States | 95823.0 | US | West | OFF-BI-10003527 | Office Supplies | Binders | Fellowes PB500 Electric Punch Plastic Comb Binding Machine with Manual Bind | 5083.960 | 5 | 0.2 | 1906.4850 | 867.69 | Low |
| 9 | 40936 | CA-2012-116638 | 28-01-2012 | 31-01-2012 | Second Class | JH-15985 | Joseph Holt | Consumer | Concord | North Carolina | United States | 28027.0 | US | South | FUR-TA-10000198 | Furniture | Tables | Chromcraft Bull-Nose Wood Oval Conference Tables & Bases | 4297.644 | 13 | 0.4 | -1862.3124 | 865.74 | Critical |
| Row ID | Order ID | Order Date | Ship Date | Ship Mode | Customer ID | Customer Name | Segment | City | State | Country | Postal Code | Market | Region | Product ID | Category | Sub-Category | Product Name | Sales | Quantity | Discount | Profit | Shipping Cost | Order Priority | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 51280 | 46582 | TU-2014-6730 | 29-11-2014 | 30-11-2014 | First Class | KF-6285 | Karen Ferguson | Home Office | Midyat | Mardin | Turkey | NaN | EMEA | EMEA | OFF-BOS-10000350 | Office Supplies | Art | Boston Pens, Blue | 34.128 | 6 | 0.6 | -49.5720 | 0.02 | Medium |
| 51281 | 6039 | MX-2014-169530 | 09-06-2014 | 11-06-2014 | First Class | HG-15025 | Hunter Glantz | Consumer | Bragança Paulista | São Paulo | Brazil | NaN | LATAM | South | OFF-PA-10002418 | Office Supplies | Paper | Green Bar Message Books, Multicolor | 84.000 | 5 | 0.0 | 9.2000 | 0.02 | High |
| 51282 | 9922 | MX-2012-100258 | 28-12-2012 | 31-12-2012 | First Class | KM-16375 | Katherine Murray | Home Office | Managua | Managua | Nicaragua | NaN | LATAM | Central | OFF-PA-10004020 | Office Supplies | Paper | SanDisk Message Books, 8.5 x 11 | 18.640 | 1 | 0.0 | 8.0000 | 0.01 | Medium |
| 51283 | 24105 | IN-2014-72327 | 30-05-2014 | 30-05-2014 | Same Day | KH-16330 | Katharine Harms | Corporate | Lucknow | Uttar Pradesh | India | NaN | APAC | Central Asia | OFF-PA-10000215 | Office Supplies | Paper | Eaton Parchment Paper, Premium | 26.940 | 2 | 0.0 | 1.8600 | 0.01 | High |
| 51284 | 24175 | IN-2014-57662 | 05-08-2014 | 10-08-2014 | Standard Class | DB-13270 | Deborah Brumfield | Home Office | Townsville | Queensland | Australia | NaN | APAC | Oceania | OFF-BI-10002424 | Office Supplies | Binders | Avery Binder, Economy | 58.050 | 5 | 0.1 | 19.9500 | 0.01 | Medium |
| 51285 | 29002 | IN-2014-62366 | 19-06-2014 | 19-06-2014 | Same Day | KE-16420 | Katrina Edelman | Corporate | Kure | Hiroshima | Japan | NaN | APAC | North Asia | OFF-FA-10000746 | Office Supplies | Fasteners | Advantus Thumb Tacks, 12 Pack | 65.100 | 5 | 0.0 | 4.5000 | 0.01 | Medium |
| 51286 | 35398 | US-2014-102288 | 20-06-2014 | 24-06-2014 | Standard Class | ZC-21910 | Zuschuss Carroll | Consumer | Houston | Texas | United States | 77095.0 | US | Central | OFF-AP-10002906 | Office Supplies | Appliances | Hoover Replacement Belt for Commercial Guardsman Heavy-Duty Upright Vacuum | 0.444 | 1 | 0.8 | -1.1100 | 0.01 | Medium |
| 51287 | 40470 | US-2013-155768 | 02-12-2013 | 02-12-2013 | Same Day | LB-16795 | Laurel Beltran | Home Office | Oxnard | California | United States | 93030.0 | US | West | OFF-EN-10001219 | Office Supplies | Envelopes | #10- 4 1/8" x 9 1/2" Security-Tint Envelopes | 22.920 | 3 | 0.0 | 11.2308 | 0.01 | High |
| 51288 | 9596 | MX-2012-140767 | 18-02-2012 | 22-02-2012 | Standard Class | RB-19795 | Ross Baird | Home Office | Valinhos | São Paulo | Brazil | NaN | LATAM | South | OFF-BI-10000806 | Office Supplies | Binders | Acco Index Tab, Economy | 13.440 | 2 | 0.0 | 2.4000 | 0.00 | Medium |
| 51289 | 6147 | MX-2012-134460 | 22-05-2012 | 26-05-2012 | Second Class | MC-18100 | Mick Crebagga | Consumer | Tipitapa | Managua | Nicaragua | NaN | LATAM | Central | OFF-PA-10004155 | Office Supplies | Paper | Eaton Computer Printout Paper, 8.5 x 11 | 61.380 | 3 | 0.0 | 1.8000 | 0.00 | High |